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MODERATE DEVIATION PRINCIPLE FOR ERGODIC 
MARKOV CHAIN. LIPSCHITZ SUMMANDS 

B. DELYON, A. JUDITSKY, AND R. LIPTSER 


Abstract. For 1 < a < 1, we propose the MDP analysis for family 

1 " 

= — T.HiXi-i), n>l, 
n a z ' 
i =1 

where (U) n >o be a homogeneous ergodic Markov chain, X n £ R d , 
when the spectrum of operator P x is continuous. The vector-valued 
function H is not assumed to be bounded but the Lipschitz continuity 
of H is required. The main helpful tools in our approach are Poisson’s 
equation and Stochastic Exponential; the first enables to replace the 
original family by A-M„ with a martingale M n while the second to 
avoid the direct Laplace transform analysis. 


1. Introduction and discussion 

Let (V n ) n >o be a homogeneous ergodic Markov chain, X n £ M. d with the 
transition probability kernel for n steps: P x ^ = P( n \x,dy) (for brevity 
P^ := P x ) and the unique invariant measure pi. 

Let H be a measurable function R“ —> Mfi with f Rd \P[{z)\p{dz) < oo and 


H(z)p(dz) = 0. 


( 1 . 1 ) 


nul 


Set 

1 n 

S n = ~ X^-l)’ (°- 5 <«<!)• 

i=l 

In this paper, we examine the moderate deviation principle (in short: 
MDP) for the family (5") n >i when the spectrum of operator P x is contin¬ 
uous. 

It is well known that for bounded H satisfying (11.11) ((H) - condition), the 
most MDP compatible Markov chains are characterized by eigenvalues gap 
condition (EG) (see Wu, [TH] , FT . Gong and Wu, [Jj, and citations therein): 

the unit is an isolated, simple and the only eigenvalue with 
modulus 1 of the transition probability kernel P x . 

In the framework of (H)-(EG) conditions, the MDP is valid with the rate of 
speed n - i 2a- F anc i the rate function I(y),y £ M. d 


i{y) 


\\\y\\ 2 B ®, B®By = y 
oo, otherwise, 


( 1 . 2 ) 


I (x)2 
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where B ® is the pseudoinverse matrix (in Moore-Penrose sense, see e.g.|X|) 
for the matrix 


B= H(x)H*(x)fi(dx) 

J 

+ E/ \H(x)(P^H)* + (P^H)H*(x)]^dx) (1.3) 


n> 1 ’ 


(henceforth, *, | • |, and || • ||q are the transposition symbol, L 1 norm and L 2 
norm with the kernel Q (||:c||q = \J(x, Qx)) respectively). 

Thanks to the quadratic form rate function, the MDP is an attractive 
tool for an asymptotic analysis in many areas, say, with thesis (see, example 
in Section 0 

“MDP instead of CLT”. 

In this paper, we intend to apply the MDP analysis to Markov chain 
defined by the recurrent equation 


X n = f{X n - 1 ,£„), n > 1 


generated by i.i.d. sequence (£ n )n>i °f random vectors, where / is some 
vector-valued measurable function. Obviously, the function / and the dis¬ 
tribution of £i might be specified in this way P x satisfies (EG). For instance, 
if d = 1 and 

X n = /(*„_!)+&,, 

then for bounded / and Laplacian random variable (EG) holds. How¬ 
ever, (EG) fails for many useful in applications ergodic Markov chains. For 
d = 1, a typical example gives Gaussian Markov chain defined by a linear 
recurrent equation governed by i.i.d. sequence of (0,1)-Gaussian random 
variables (here |a| < 1) 

X n — ClX n —i + £ n . 

In order to clarify this remark, notice that if (EG) holds true, than for 
any bounded and measurable function H , satisfying (H)-property, for some 
constants K > 0, g £ (0,1), n > 1, 


\E x H(X n )\ < Kg n . (1.4) 

However, the latter fails for H{x ) = sign(x) satisfying (11.11) . In fact, if m 
were correct, then \E X H(X n )\ < On the other hand, it is readily 

to compute that Y^= o \E x H(X n )\ grows in |x| on the set {|x| > 1} faster 
than 0(log(|x|). 


<const 


In this paper, we avoid a verification of (EG). Although our approach is 
close to a conception of “Multiplicative Ergodicity” (see Balaji and Myen j^) 
and “Geometrical Ergodicity” (see Kontoyiannis and Meyn, [S] and Meyn 
and Tweedie, nil), Chen and Guillin, [1]) we do not follow explicitly these 
methodologies. 

Our main tools are the Poisson equation and the Puhalskii theorem from 
m- The Poisson equation enables to reduce the MDP verification for 
(5") n >i to (iM n )„>i, where M n is a martingale generated by Markov 
chain, while the Puhalskii theorem allows to replace an asymptotic analysis 
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sec+2 


N.l 


L. 2 


theo-4.1 


for the Laplace transform of M n by the asymptotic analysis for, so called, 
Stochastic Exponential 


<? n (\) = l[E 

i= 1 


exp 


( a -^« 


Mi_!) 


■W_1 


A G 


(1.5) 


being the product of the conditional Laplace transforms for martingale in¬ 
crements. 

An effectiveness of the Poisson equation approach (method of corrector) 
combined with the stochastic exponential is well known from the proofs 
of functional central limit theorem (FCLT) for the family (S'((' 5 ) n >i (see, 
e.g. Papanicolaou, Stroock and Varadhan DEI, Ethier and Kurtz |G|, Bhat - 
tacharya [3], Pardoux and Veretennikov m ; related topics can be found in 
Metivier and Priouret (80’s) for stochastic algorithms analysis. The use of 
the same approach for a continuous time setting can be found e.g. in 


2. Formulation of main result 

We consider Markov chain (X n ) n >o, X n G defined by a nonlinear 
recurrent equation 

X n = f(X n -!,S n ), (2.1) 

where / = f(z,v ) is a vector function with entries fi(z, v ),..., fd(z, v), 
u G »Gl p and (£ n ) n >i is i.i.d. sequence of random vectors of the size 

p. 

We fix the following assumptions. 

Assumption 2.1. Entries of f are Lipschitz continuous functions in the 
following sense: for any v 

\fi(zi .. ■ ...,z d ,vi,.. .,v p ) 

- fi(zi . . . , Zj-I, z", Z j+ 1 ...,Z d ,Vi,...,V p )\ 

< Qij \ Zj - z" |, 

I f(z',v) - f(z",v)\ < q\z' - z" I, 


where 

max = q < 1. 
i,j 

Assumption 2.2. For sufficiently small positive 5, Cramer’s condition holds: 

Ee 5 ^ < oo. 


Theorem 2.1. Under Assumptions [^3 and, I AM the Markov chain is er- 
godic with the invariant measure fi such that f Rd \z\fi(dz) < oo. For any 
Lipschitz continuous function H, satisfying (11.11) . the family (5 , “) n >i obeys 
the MDP in the metric space (M rf , r) (r is the Euclidean metric ) with the 
rate of speed n~^ 2a ^ 1 ' 1 and the rate function given in ca¬ 


st ex 


4.1a 
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Remark 1. Notice that: 

- assumptions of Theorem \2.1\ do not guarantee (EG), 

- Lipschitz continuous H, obeying the linear growth condition, are permis¬ 
sible for the MDP analysis, 

- the £ 1 - distribution with a continuous component is not required. 


Consider now a linear version of (HU): 

X n = AX n —i + £ n , 

where A is the d x d-matrix with entries Aij. Now, Assumption 12. Il reads as: 
rnaxjj \Aij\ < 1. This assumption is too restrictive. We replace it by more 
natural one 


L.l 


th.eo-1 


Assumption 2.3. The eigenvalues of A lie within the unit circle. 

Theorem 2.2. Under Assumvtion \2.tA the Markov chain is ergodic with the 
invariant measure p such that f Rd ||z|| 2 /r(cfe) < oo. For any Lipschitz con¬ 
tinuous function H , satisfying (ll.il) . the family (S'") n >i obeys the MDP in 
the metric space (M rf ,r) with the rate of speed and the rate function 

given in a. 


sec+3 
sec+3.1 


3. Preliminaries 

3.1. (EG)-(H) conditions. To clarify our approach to the MDP analysis, 
let us first demonstrate its applicability under (EG)-(H) setting. 

The (EG) condition provides the geometric ergodicity of -Pj n ^ to the in¬ 
variant measure p uniformly in x in the total variation norm: there exist 
constants K > 0 and g £ (0,1) such that for any x S R d , 


\\P^-p\\ tv <Kg n , n> 1. 


The latter provides an existence of bounded function 

U(x) = H(x) + J2 P * n)H ( 3 - 1 ) ® 

n> 1 


solving the Poisson equation 

H(x) = H(x) + P X U. 


(3.2) 


Poissonl 


In view of the Markov property, a sequence (Ci)*>i of bounded random 
vectors with£j := U(Xf) — Px t _iU forms a martingale-differences relative 
to the filtration generated by Markov chain. Hence, M n = Ci is th e 

martingale with bounded increments. With the help of Poisson’s equation 
we get the following decomposition 


1 1 1 

— ]T H(Xi _0 = ~[U(x) - U(X n )\ +—M n . 

my- L ^ •ny- ny 


(3.3) 


PoissDec 


The boundedness of U provides a corrector negligibility in the MDP scale, 
that is, the families 5" and -ppM n share the same MDP. In view of that, 
suffice it to to establish the MDP for 
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Assume for a moment that £j’s are i-i.d. sequence of random vectors. 
Recall, EQ\ = 0 and denote B = E^iQ. Then, the Laplace transform for 

is: 

S’n(X) = (Ee^'&Y, A € M. d . (3.4) 

Under this setting, it is well known that — M n obeys the MDP if B is not 
singular matrix and 

lim n 2a ~ 1 log S’n(X) = \(X,BX), X € R d . 

n—>oo 2 

We adapt this method of MDP verification to our setting. Instead of B, we 
introduce matrices R(AQ_i), i > 1 with 

B(x) = P X JJU* — P X U(P X U)*. (3.5) 

The homogeneity of Markov chain and the definition of Q provide a.s. that 


E((iC\ x i-i) = B(Xi-!). 

Instead of the Laplace transform (HO) . we apply the stochastic exponential 
CC3, expressed via (fs, 


<&(A) = n^( e<A ’^ > l^~ 1 )> 

i=l 


which is not the Laplace transform itself. 

The Poisson equation 621 and its solution EB permit to transform (13.511 
into 

B{x) = H(x)H*(x) + J2 [h(x){P^HY + {pY ] H)H*] , 

n> 1 

that is, f Rd B(z)n(dz ) coincides with B from (11.31) . 


Now, we are in the position to formulate 

Puhalskii Theorem, [for more details, see G and [?].] Assume B from 
D is nonsingular matrix and for any e > 0, A G 


lim 
n— KX) 77 , 


2a 


— log P 


n 2a_1 log<f n (A) — ~(X,BX) 


> £ = —OO. 


(3.6) 


Then, the family M n , n > 1 possesses the MDP in the metric space 
(M d ,r) (r is the Euclidean metric ) with the rate of speed and rate 

function I(y) = \\\y\\ 2 B -i- 


Remark 2. The condition cut is verifiable with the help of 

1 1 i n 

log jP (“|H( A , [5(Xi_i) -R]A) > e) = -oo 


lim 


n —>oo n 


2a-1 


i=l 


lim log P (^ E flCi | 3 e n "“ ICil 


(3.7) 


n —>oo n 


2a—1 


> £ = —OO. 


i =1 


LapO 


B (x) 


2.7Puhll 


lid 
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The second condition in dimi is implied by the boundedness of |£j[’s. The 
first part in (1X71) is known as Dembo’s conditions, [?], formulated as follows: 
for any e > 0, A £ M. d 


1 1 I 

I™ -logPf- V(A, [B(X l _ 1 )-B]\ 
n^oo n \n \ z —' \ 

i=l 


> £) < 0 . 


In order to verify the first condition in EZJ, we apply again the Poisson 
equation technique. Set h(x) = (A, \B{x) — B~\ A) and notice that 



= 0. 


Then, the function u(x) = h{x) + X^n>i Px'h is well defined and solves the 
Poisson equation u(x) = h(x) + P x u. Similarly to (IThi) , we have 


n 


u(x) - u(X n ) 
n 


+ 


m n 

' 

n 


where m n = " = i z % is the martingale with bounded martingale-differences 

(zj)j>i. Since u is bounded, the first condition in (IT71) is reduced to 


lim log P(\m n \ > ns) = —oo 

n— kx) Tl za 1 v J 


(3.8) 


3.13a 


while (13.81) is provided by Theorem lA.il in Appendix which states that (13.81) 
holds for any martingale with bounded increments. 

sec+3.1.1 

3.1.1. Singular B. The conditions from m remain to hold whether B is 
nonsingular or singular. For singular B the Puhalskii theorem is no longer 
valid. With singular B, we use the Puhalskii theorem as helpful tool 

It is well known that the family n > 1 obeys the MDP with the rate 
of speed and some rate function,say I(y) provided that 


lim lim log P 


C—»oo n— >oo n 

lim lim 


Mr, 


n u 


> C = -oo 


e—>0 n—>oo n 


lim lim 


2 OL 


— logP 


£—>0 n— KX) Tl 


2a 


— log p 


Mn 

n a 

M„ 


n L 


< £) < -l(y) 

<e)> 


(3.9) 2.21singl 


The first condition in drill) provides the exponential tightness in the metric 
r while the next others the local MDP. ^ 

In order to verify of we introduce “regularized” family , n > 1 

with 

n 

M£ = M n + v^5>i’ 

i= 1 

where (3 is a positive parameter and ('i?i)i>i is a sequence of zero mean i.i.d. 
Gaussian random vectors with cov(i?i,$i) =: I (I is the unit matrix). The 
Markov chain and (i?i)i>i are assumed to be independent objects. 
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It is clear that for this setting the matrix P is transformed into a positive 
definite matrix Bp = B + (31. Now, the Puhalskii theorem is applicable and 
guarantees the MDP with the same rate of speed and the rate function 

1 „ 


W) = 




We use now the well known fact (see, e.g. Puhalskii, HI) that MDP provides 
the exponentially tightness and the the local MDP: 


lim lim 

C —>oo n —>oc n 


1 


2 a 


— logP 


Mn 


1 


logP 


lim lim _ . 

e —>0 n —>oo n Za 1 

lim lim 9 log P 


n —>oo 71 


n u 
Mn_ 
n a 

Ml 

n a 


y 


> CJ = -oo 

< e) < -ip(y) 

< e) > -Ip(y). 


Notice now that (EH) is implied by (IXTU1) if 

l|e» B®By = y 


linr/^(y) = 




oo, 


and 


lim lim 

/3—► 0 n —>oo n 


2 a 


— l°g P 


VP 


n u 


i =1 


otherwise 


> rj ) = —oo, V 77 > 0. 


(3.10) 


(3.11) 


(3.12) 


Let T be an orthogonal matrix transforming B to a diagonal form: diag(P) 
T*BT. Then, owing to 


2Ip(y) = y*(pi + B)~ l y = y*T((3I + diag (B^T'y, 
for y = B®By we have (recall that P®PP® = P®, see 1 ) 

2 Ip(y) = y*B®BT((3I + diag (P))" 1 ^ 

= y*B®TT*BT((3I + diag(P))" 1 T*y 
= y*B®T diag(P)(/37 + diag(P))“ 1 T*y 
y*P®T diag(P) diag((P))®T*y 

= y*P®Tdiag(P)T*T(diag(P))®T*y 
= y* B® BB®u = u* B®y = ||y||| ffi = 2/(y). 

If V + B®B y , liing^o 2 / 3 ( 2 /) = 00 . 

Thus, (13.111) holds true. 


Since (i?i)*>i is i-i.d. sequence of random vectors and entries of are 
i.i.d. (0,1)-Gaussian random variables, the verification of (13.121) is reduced 
to 

lim lim — 9 —-logPf P > —V.) = —00 (3.13) 

/3—*0n—>00 77, 2q: 1 & Pd/ ’ V ^ 

i =1 

where (£i)i>i is a sequence of i.i.d. (0,1)-Gaussian random variables, and 
it suffices to consider the case “+” only. By the Chernoff inequality with 


2.21songl 


2.23+1 


2.24endl 


Ibetal 
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A > 0, we find that 


p(X> 


> 


2—1 


C¥ 

nr] 

7 v 


) < exp ( — A 


A 2 

1 —==■ + n — 

VP 2 


while the choice of A = VJr, P roy ides 


sec+3.2 


1 


n 


2a -1 




2—1 


-> —oo. 


3.2. Virtual scenario. 

- (EG)-(H) are not assumed 

- the ergodicity of Markov chain is checked 

- H is chosen to hold ten. 


(1) Let (13.111 hold. Hence, the function U solves the Poisson equation and 
the decomposition from EH is valid with M n = YV=i Co where 


Ci = u(Xi ) - Px^u. 


Let 


EC,*Ci E const. 


E 


ICil 




< const. 


(2) With B(x) and B are defined in (13.51) and <Q> respectively, set 
h{x) = (\, [B{x) -B]\), A G R d . 

Let 

(i) u{x) = h(x ) + ^ n >i Px^h is well defined 

(ii) for Zi = u{Xi) - Px^u, 


Ezf < const. 


E 


I zV e n 


*N|X 1 

K*-2—1 


< const. 


(3) For any e > 0, let 

lirn ^_logP(|H(V n )| > n a e) = -oo 

n—> oo ,L v / 

lim log P(\u(X n )\ > n a e) = -oo. 

n— kx) v 7 


Notice that (EG)-(H) provide (l)-(3) and even if (EG)-(H) fail, (l)-(3) 
may fulfill. Moreover, (l)-(3) guarantee the validity for all steps of the 
proof given in Section liTTl 

Thus, an ergodic Markov chain, possessing (l)-(3), obeys the MDP. 

The proof of Theorems 12.11 and r rn follows this scenario. 
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sec+4 



lem-4.1 


4. The proof of Theorem 12.11 

4.1. Ergodic property. 

Lemma 4.1. Under Assumvtion \2. 1\ ( X n ) n >o possesses the unique proba¬ 
bility invariant measure p with f Rd \z\p(dz) < oo. 


Proof. Let v be a probability measure on with f Rd \x\v(dx) < oo and 
let a random vector Xq, distributed in the accordance to v, is independent 
°f (£n)n>i- We initialize the recursion, given in (12.11) . by Xq. Let now 
X n is generated by (12.11) . Then, p n (dz ) = f Rd P x n \dz)is(dx) defines the 
distribution of X n . 

We show that the family ( p n ) n >i is tight in the Levy-Prohorov metric: 

lim lim p n (\z\ > k) = 0. 

k —>oo n—*oo 

pi y | 

By the Chebyshev inequality, p n {\z\ > k) < k n . The tightness follows 
from sup n>1 E\X n \ < oo. Further, since By Assumption 12.II 

l*n| = 1/(0, ^n) + (/(*„-!,£„) - /(0,£ n ))| 

< |/(0,UI + l/(^n-l,U-/(0,U)| 

< |/(0,^)| + ^n-l| 

< \f(0,0)\+t\Zn\ + Q\X n -i\, 


the sequence {E\X n \) n >\ solves a recurrent inequality 

E\X n \ < |/(0,0)| +£E\^\ + e E\X n _ 1 \ 
subject to E\Xq\ = f Rd \x\n(dx)(< oo). Hence, we find that for any n > 1, 

|/(0,0)| +tE\h\ 


E\X n \ < E\Xq\ + 


1 ~ Q 


Thus, the family {p n } is tight, so that, by the Prohorov theorem, {p n } 
contains further subsequence {p n } converging, as n! oo, in the Levy- 
Prohorov metric to a limit p being a probability measure on R f Z for any 
bounded and continuous function g on W l 

lim f g(z)p n '(dz) = I g(z)p(dz). 
n'^oc J R d J R d 

Thence, for g(z) = L A \z\ and L > 0, it holds 

(L A \z\)p(dz) = lim E(L A |X n /|) < lim E\X n \ < oo 

1 n '—>oo n—> oo 

and, by the monotone convergence theorem, 

\z\p(dz ) < lim E\X n \ < oo. 


/ 

J R c 


/ 

Jr* 


The p is regarded now as a candidate to be the unique invariant measure. 
So, we shall verify 


g(x)p(dx) = / P x gp(dx). 

’■ Jr d. 
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for any nonnegative, bounded and continuous function g. For notational 
convenience, write X* and X", if Xo = % and Xo is distributed in the 
accordance with v. By Assumption 12.11 

^-X^KelX^-X^l, n> 1, 

that is, |X® — X”\ converges to zero exponentially fast as long as n —> oo. 
For any x E M d , the latter provides linv^oo Eg(X ®,) = f Rd g(x)g(dx). Since 
the Markov chain is homogeneous, we also find that 

lim Eg(X% +1 ) = [ g(z)g(dz). 
n'-> oo 

On the other hand, owing to Eg(Xf, +1 ) = EPx x ,g , the above relation is 
nothing but 



Finally, owing to P x g = Eg(f(x,f i)), the function P x g of argument x is 
bounded and continuous. Consequently, lim EPx x ,g = L d P x gg(dx). 

Assume g' is another invariant probability measure, g! g. Then, taking 
Xq and Xq , distributed in the accordance to g and g' respectively and 
independent of (£ n )n>i, we get two stationary Markov chains (X%) and 
(Xn ) defined on the same probability space as: 

*£ = /(*£-i,£n) 

X^ = f{X^ n ). 

By Assumption EHJ |X#-X# \ < qIX^^- X ^_ x |, i.e. lim | Xff-Xfr \ = 0. 

n —>oo 

Recall that both processes Xn and Xn are stationary with the marginal 
distributions g and g’ respectively. Hence, for any bounded and continuous 
function g : — > R, 


sec+4.2 


lem-2.la 


I g(x)g(dx)~ g(x)g'(dx) < E\g{X£) - s(X£')| 
R d J«. d 

that is, g = g'. 


0, 


□ 


4.2. The verification of (1). Let K be the Lipschitz constant for H. Then 
\H(x)\ < |iL(0)| + K\x\ and f Rd \H(z)\g(dz ) < oo. By (11.11) . EH(XH) = 0. 
Then, 

\EH{XZ)\ = \E(H(XZ)-H(X£)\ 

< Kg n E\x - X£| < K{ 1 + \x\) e n . 

Therefore, ^ n>1 \EH(X%\ < y^(l + |:r|). Consequently, the function U(x), 
given in ED, is well defined and solves the Poisson equation. 

Recall that Q = U(Xf) — Px i _ 1 U. 


Lemma 4.2. The function U{x) possesses the following properties: 

1) U(x) is Lipschitz continuous; 

2) P X (UU*) — P X U(P X U)* is bounded and Lipschitz continuous; 

3) For sufficiently small 5 > 0 and any i > 1 


E 


(|C/(X i )-P Yi _ 1 C/| 3 e' 


8\U(Xi)-P x . U\ 


X,- 


< const. 
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Proof. 1) Since by Assumption 12.II 


\x* - xf\ < 6 \x*_i - xCi\, \Xq - xf\ < |*' - x'% 

we have 

I U(x') - U{x ")| < | H(x') - H(x ")| + J2 E \ R (Xn) ~ H(X?)\ 


n> 1 


k . , 

< - x 1 -x"\ 


(4.1) 4.6pi 


1 ~ Q 


2) Recall (see (14.51) 1 

p x (uu*) - p x u(p x uy = b{x) 


and denote B pq (x), p,q = 1,... ,d the entries of matrix B(x). Also, denote 
by U p (x), p = 1 ,,d the entries of U (x). Since B(x) is nonnegative definite 
matrix, suffice it to show only that B pp (xy s are bounded functions. Denote 
F(z) the distribution function of £i. Taking into the consideration (14.11) and 
Assumption EU we get 


B 


pp 


(x) = E(u p (f{x,£ i)) - [ U p (f(x,z))dF(z )) 

v JR d 7 

(AT) 2 


^ m 2 p 
- (i-rf 2 


l£i - z\dF(z ) 


< 4 


(i - qY 


E I6I 2 <°o. 


The Lipschitz continuity of B pq (x) is proved similarly. Write 
B pq (x') — B pq (x") =: ab — cd, 


where 

a = E(u p (f(x l ,f 1 ))- f U q (f(x',z))dF(z)) 

v JR d 7 

b = E(u q {f(x',f 1 ))~ f U q {f(x , ,z)dF(z)) 

v JR d 7 

c = E(u p {f(x",f 1 ))- [ U q (f(x",z))dF(z)) 

v JR d 7 

d = E(u q (f(x",i i))- [ U q (f(x”,z))dF(z)). 

v JR d 7 

Now, applying ab — cd = a(b — d) + d{a — c) and taking into account (14.11) 
and Assumption 12.11 we find that |a|, |d| < and so 

I B pq (x') - B pq (x”)\ < ^ 2 A|gi[|x / - x"\. 


3) By (14.11) and Assumption 12.II 

KP 

\U(X l )-P Xi _ 1 U\< T —(E\^\ + \^\). 

L (.J 


□ 
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sec+4.3 


sec+4.4 


4.3. The verification of (2). The properties of B(x ) to be bounded and 
Lipschitz continuous provide the same properties for 

h{x) = (A, [B(x) - B] A). 

Hence (2) is provided by (1). 

4.4. The verification of (3). Since U and u are Lipschitz continuous, they 
possess the linear growth condition, e.g., \U(x)\ < (7(1 + |x|), 3(7 > 0. So, 
(3) is reduced to the verification of 


lim 7*~ logPf \X n \ > sn a ^\ = —oo, e > 0. 

n — >og u za 1 V / 

Due to Assumption 12 . II we have 

\Xn\ < \f(X n _i,£ n )\ < |/(o,f n )| +e\Xn-l\ 

< \f(0,0)\ + Q\x n ^\ + e\u 

Iterating this inequality with Ao = x we obtain 

n n 

\x n \ < e n M + l/(o, o)| £ 6 n ~ j + ^Q n - j \Cj 

3 = 1 3 = 1 

n —1 


(4.2) 


2.10a 


+ ,e 


l-g 




1=0 


Hence, (TOli is reduced to 
1 


n—1 


lim 

n—►oo 77, 


2 ct— 


T logP(E^I^I ^ = -°°- 

1=0 


(4.3) 


4.9m 


We verify (IO|l with the help of Chernoff’s inequality: with 5, involving in 
Assumption 12.21 and 7 = y/— 


n— 1 


l=o 

The i.i.d. property for //s provides 

^ e E"3o _ ^ e E/3o 1 ') , e : 'l6l < £ e E£L 0 7ed£i| _ _g e <5|6l < qq 

and we get 


1 


n 


2a- 


n —1 

T I«gP(X:^n- J | ^ 

1=0 


_d > n a e) < -n 1 ~ Q fe + 


log 


n^ a 1 n —>00 


* —00. 


sec-5 


5. The proof of Theorem 12.21 

The proof of this theorem differs from the proof of Theorem 12 .II only in 
some details concerning to (L.l). So, only these parts of the proof are given 
below. 
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sec-5.1 

5.1. Ergodic property and invariant measure. Introduce (£ n )n>i the 
independent copy of (£ n )n>i- Owing to 


X n = A n x + J2 An ~% 
2—1 


n— 1 

A n X + £ A'tn-i, 

i=0 


we introduce 


72 — 1 

X n = A n x + Y J A % 

i =0 


(5.1) 


and notice that the i.i.d. property of (£i)i>i provides ( X n ) n >o '= ( X n ) n >o. 
By Assumption 12.51 A n —» 0, n —> oo, exponentially fast. Particularly, 


^trace (yTcov(£i,£i)(A*)*) < oo, 
i =o 


4.2x 


sec-5.2 


sec-5.3 


so that lim X n = y^*l n a.s. and in L 2 norm. 

Thus, the invariant measure g is generated by the distribution function 

~ OO 

of Aoo. In addition, EHX^II 2 = trace (A* cov(£i, £i)(A 1 )*), so that 

i =o 

/ ||z|| 2 //(d2) < oo. 

J R d 

5.2. The verification of (1) and (2). Due to 

(x*-x*') = A{x*_ 1 -x*L 1 ), 

we have (X*' — X^') = A n (x' — x" ). Let us transform the matrix A into a 
Jordan form A = TJT^ 1 and notice that A n = TJ n T~ 1 . It is well known 
that the maximal absolute value of entries of J n is n|A| n , where |A| is the 
maximal absolute value among eigenvalues of A. Bv Assumption rOl |A| < 1. 
So, there exist K > 0 and g < 1 such that |A| < g. Then, entries A pq of A n 
are evaluated as: \A pq \ < Kg n . Hence, \X^ — X%”\ < Kg n \x' — x"\, n > 1, 
and the verification of (1), (2) is in the framework of Section [3J 


5.3. The verification of (3). As in Section 0 the verification of this prop¬ 
erty is reduced to 

lim —tz —-logPflAfjJ > en a 'j = —oo, e > 0. (5-2) 

n— >oo n za 1 V / 

In (lA2ll . we may replace X n by its copy X n defined in (EH). Notice also 
that 

oo 

I An | < \A n x\ + ^max|A*J|£|. 
i =o pq 

As was mentioned above, \A pq \ < Kg? for some K > 0 and g G (0,1). Hence, 
suffice it to verify 

lim 2 a i lQ g P ( Yl 0*l&l > en “) = _00 > £ > 0 

n—>oo Ti za 1 V z ' J 

2—0 

what be going on similarly to corresponding part of the proof in Section |3J 


2.lOaa 









14 


B. DELYON, A. JUDITSKY, AND R. LIPTSER 


sec-2.3 


6. Exotic example 

Let (X n ) n >o, X n S M and Xq = x, be Markov chain defined by the 
recurrent equation 


X n = X n _i - m 1 + £ n , 


( 6 . 1 ) 


exmpl 


where m is a positive parameter, (£ n ) is i.i.d. sequence of zero mean random 
variables with 

Ee 5 ^ < oo, for some 5 > 0, 

and let § = 0. 

Although the virtual scenario is not completely verifiable here we show 
that for 

H(x) = f 

\x\ 

the family (5“) n >i possesses the MDP provided that 


m > - log Ee s ^. 
o 

Indeed, by EU we have 

1 y~^ Xk-i _ 1 (X n — x ) ^ y-v 

n a ^ I AT-, I m n a n a ^ m 
k= 1 1 K 11 k =1 


( 6 . 2 ) 


mdel 


The family possesses the MDP with the rate of speed n ^ 2 “ ^ 

k =l n ~ 

and the rate function I(y) = ?y 2 . Then, the family (*S'“) n >i obeys the 

same MDP provided that ( Xr ^a X ) >1 is exponentially negligible family with 
the rate n~^ 2a ^ 1 ' 1 . This verification is reduced to 


By the Chernoff inequality \X n \ > n Q e) < e Sn0l£ Ee s l x ”l, that is (16.dll 
holds if sup Ee^ Xn \ < oo for some <5 > 0. We show that the latter holds true 

n> 1 

for 5 involved in (HE2J). a helpful tool for this verification is the inequality 

\z — mn < \ \z\ — ml. Write 
i DM — i 1 1 i 

Ee 5 l x "l = Ee^ Xn ^I(\X n ^i\ < m) + Ee 5 l x "l/(|X n _i| > m) 

< e 5m Ee s ^ + e- Sm Ee s ^ ll Ee slXn - 11 . 


lim — 7 . —-logP(|XJ > n a e ) = —oo, e > 0. 

n .—inn J- ' ’ 


(6.3) negl 


Set^ = e Sm Ee s ^ and g = e Sm Ee s ^' 1. By (16.211 . g < 1. Hence, V(x) = e 6 ^ 
is the Lyapunov function: P X V < gV(x) + i. Consequently, 

EV(X n ) < gEV(X n )+£, n> 1 

and so, sup n >! EV(X n ) < V(x) + 
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sec-7 


7. Statistical example 


An asymptotic analysis, given in this section, demonstrate the thesis 
“MDP instead of CLT”. 

Let 

X n = 0f(X n _ 1 ) + £ n , 

where 9 is a number and (£ n )n> l is i.i.d. sequence of of (0,1)-Gaussian 
random variables. We assume that |0[ < 1 and / is bounded continuously 
differentiable function with j f'(x)\ < 1. By Theorem 12.11 (X n ) is an ei'godic 
Markov chain and its invariant measure /jlq depends on parameter 9. Since 
is Gaussian random variables, pg, being a convolution of some measure with 
Gaussian one, possesses a density relative to dz. Then, assuming f 2 (x) > 0 
relative to Lebesgue measure, we have Bg = j R f 2 (z)n(dz) > 0. Under the 
above assumptions, 

n EILl /(*«-!)*« 
n E”=i/ 2 (^-i) 

is a strongly consistent estimate of 9 by sampling {Xi,...,X n }, that is, 
lirrij^oo 9 n = 9 a.s. Moreover, it is known its asymptotic in the CLT scale: 


\fn{9 - 9 n ) 


law 

- > 

n— kx) 


i 



-Gaussian r. v. 


Here, we give an asymptotic of 9 n in the MDP scale: for any a € Q, l), 


n l ~ a (9 - 9 n ) 


MDP 


n 


2 _ yf_\ 

2Bg)' 


7.1c 


Theorem 7.1. The family n 1 a {9 — 9 n ) obeys the MDP with the rate of 
speed and the rate function I(y) = . 


Proof. The use of 


n 


1 ~ a (9 9 ) 1 f (X{ 1 


„'E " =1 

and the law of large numbers, P- lim n _ >00 ^ Ya= i / 2 (^«- 1 ) = Bg, give a hint 
that that the theorem statement is valid provided that 

(i) for M n = E,:=i f(Xi- 1 )&, the family obeys the MDP 

with the rate of speed n 2 l-i and the rate function I(y) = ; 


(ii) for any e > 0, 
lim 


n—>oo Tl 


1 1 n 

i— 1 


> £ = —OO. 


Following to m and taking into account the setting, we notice that 

" A 2 


<f n (A) = exp ( J2 • 


1=1 


is the stochastic exponential related to Mn) ri ^ x - Consequently, m is 

reduced to (ii), that is, only (ii) is left to be verified. 

The verification of (ii) is in the framework of Theorem (EU>- The function 
H(x ) = f 2 (x) — Bg satisfies the assumptions of Theorem 12.11 Hence, the 
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family Yli=k H ( X k i)) n ^ 0O obeys the MDP with the rate of speed ^rr 


and the rate function 




v OO , B e = 0, y / 0, 
where, in accordance with OK, 

B e = [ H 2 (x)ne(dx) + 2Y^ [ H{x)P^ Hp e {dx). 


n>l ' 


In particular, 


1 


lim 2a-1 lQ g P 
n—>oo n ^ 1 


k =1 

Hence, for any C > 0, we find that 

1 i 1 n 

lim -^T lo g p ( 

n—>oo 77, 1 \| 77, z ^ 


. . —-i-C 2 e 2 , > 0 

> Cs) < <! 2B » 

otherwise. 


—oo, 


> £ 


k =1 


= lim logP 

n—>oo 72 za 1 


< I™ 2a _i log-P 

n—> oo 77, za: 1 


1 71 

-5>(w_o 

fc=i 

1 n 

-JX**-.) 


fc=l 


> n 1_a e 


> Ce 


< ^ 2 Bg 


Bq > 0, 


—oo otherwise C—>oo 


-> —oo. 


□ 


theo-A.1 


Appendix A. Exponentially integrable martingale-differences 
Let ( n = (Cn)n>i be a martingale-difference with respect to some filtration 

n 

JP = {^n)n> o and M n = Y1 Ci be the corresponding martingale. 

i— 1 

Theorem A.l. Assume that for sufficiently small positive 6 and any i > 1 


E{e 5 ^\^i-i) < const. 


(A.l) 


A.l 


Then for any a 6 (0.5,1) 


lim logP(|M n | > ne) = -oo. 

n—xx) 77, ZQ! 1 

Proof. Suffice it to prove lim ai-i log P( ± ML > ns) = —oo. We verify 
here only “+” only (the proof of is similar). 

For fixed positive A and sufficiently large n, let us introduce the stochastic 
exponential 

n. 


£ n (\) = HE(e x ^\L? i - 1 ). 


2—1 
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A direct verification shows that 


E ex p 


A M n 


n 


- log 


= l. 


We apply this equality for further ones 


1 > El(^M n > ne^j exp ^-— — log<f r 

> El i^M n > ne^ exp — log S n 
Due to E (A^|«^i_i) = 0 and HA.Ill , we find that 

n >. 

log^n(A) = ^ lQ g f 1 + E[e X " ~ 1 ~ A—|#i_i 


(A.2) 


i =1 


n 


< E {^ E ((Ci) 2 iA-.) + EEdCiiv^i^-i)} 


2=1 

< AT 


12n 2 

A 2 ^ 

2n + 6n 2 J ’ 


where K is some constant. This inequality, being incorporated into (EH, 
provides 

/ r A 2 a 3 t 

1 > El (M n > ne) exp ( \e — I\ 

If e < 3, taking A = enK ~ l , we find that 


.2 n 6n 2 J ) 


n 


1 , , £ 2 ro 2( ' 1 “1/1 e 

2 ^ti log-P(M n > ne) < - 


K 

Thus, the desired statement holds true. 


6/ n—>oo 


-> —oo 


□ 
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